Providing composite graphical assistant interfaces for controlling various connected devices

ABSTRACT

Methods, apparatus, systems, and computer-readable media are provided for tailoring composite graphical assistant interfaces for interacting with multiple different connected devices. The composite graphical assistant interfaces can be generated in response to a user providing a request for an automated assistant to cause a connected device to perform a particular function. In response to the automated assistant receiving the request, the automated assistant can identify other connected devices, and other functions capable of being performed by the other connected devices. The other functions can then be mapped to various graphical control elements in order to provide a composite graphical assistant interface from which the user can interact with different connected devices. Each graphical control element can be arranged to reflect how each connected device is operating simultaneous to the presentation of the composite graphical assistant interface.

BACKGROUND

Humans may engage in human-to-computer dialogs with interactive software applications referred to herein as “automated assistants” (also referred to as “digital agents,” “chatbots,” “interactive personal assistants,” “intelligent personal assistants,” “assistant applications,” “conversational agents,” etc.). For example, humans (which when they interact with automated assistants may be referred to as “users”) may provide commands and/or requests to an automated assistant using spoken natural language input (i.e. utterances), which may in some cases be converted into text and then processed, and/or by providing textual (e.g., typed) natural language input.

Some automated assistants can be used to control internet of things (“IoT”) devices. However, many IoT devices can come with their own corresponding applications. As a result, a user that has acquired multiple different IoT devices may be tasked with installing a variety of third party applications on their personal computing devices. Because applications can typically require memory, network bandwidth, updates, and/or other resources, having a variety of similar applications on a single device can be inefficient. Although a user may rely on an automated assistant to mediate interactions between the user and a particular IoT device, such interactions can frequently require spoken utterances from the user to be converted from audio to text. In order to effectuate conversion of audio to text, audio data oftentimes must be sent over a network, thereby consuming significant network bandwidth. Furthermore, mediating interactions in such a way can prove unreliable when a user is surrounded by other people that are talking, or is otherwise in environment with background noise. As a result, a dialog session between a user and an automated assistant may waste computational resources and network resources when requests for various IoT devices to perform particular functions are ultimately not identified, or otherwise not fulfilled.

SUMMARY

The present disclosure is generally directed to methods, apparatus, and computer-readable media (transitory and non-transitory) for providing composite graphical assistant interfaces through which to control multiple different devices. The composite graphical assistant interface can be used to control a variety of different devices, despite the different devices being manufactured by different third parties. In this way, a user does not necessarily have to install multiple different third party applications to control their respective IoT devices. Rather, a user can rely on their automated assistant to recognize particular contexts in which a user would like to control a particular IoT device, and to provide a composite graphical assistant interface that includes suggestions for particular IoT devices to control.

For example, a user can enter their kitchen, which can include a variety of different devices, such as a client device that provides access to an automated assistant, and various connected devices that are in communications with the client device. The connected devices can include a stereo system, an oven, smart lights, and a coffeemaker. Each of the connected devices can be manufactured by different third party manufacturers, at least relative to the manufacturer of the automated assistant. Typically in the morning when the user enters the kitchen, the user can provide a command such as, “Assistant, turn on the lights in the kitchen.” The user may then access a variety of applications or manually interact with the connected devices in order to cause other functions to be performed. Although the user could verbally initialize the automated assistant to cause each function to be performed (e.g., “Assistant, turn on the coffeemaker . . . Assistant, play some music”), this would necessitate additionally speech processing, thereby consuming computational and network resources. Furthermore, if the user was required to access a different application to control each device, each application may require some amount of network connectivity, thereby further inefficiently consuming computational and network resources. In order to provide a more efficient use of such resources, the automated assistant can provide a composite graphical assistant interface in response to receiving an initial request to perform a function and/or in response to a user selection at a particular assistant interface.

For example, in response to receiving a spoken utterance such as, “Assistant, turn on the kitchen lights,” the automated assistant can identify a current status of other connected devices within the kitchen in order to identify particular functions to recommend to the user. For instance, if the coffeemaker in the kitchen is determined to be on at the time the spoken utterance is received, the automated assistant can omit any reference to the coffeemaker in the composite graphical assistant interface being compiled. In this way, the automated assistant would not be making a redundant suggestion, and can make room on the composite graphical assistant interface for more suitable suggestions. Additionally, if the automated assistant determines that the stereo system in the kitchen is not currently playing music, the automated assistant can determine a function to suggest to the user for controlling the stereo system. For instance, the automated assistant can provide a composite graphical assistant interface that includes a graphical control element for turning on the stereo system and/or changing a volume of the stereo system. The composite graphical assistant interface can be provided at a display panel that is integral to the client device, or at a display panel that is associated with the client device (e.g., a tablet device that is also located in the kitchen). In this way, the user has a less computationally intensive modality through which to turn on the stereo system, and the modality can be provided to the user in a timely manner in response to their initial spoken utterance.

In some implementations, a suggestion for a particular function to be performed by a connected device can be based on data that indicates a historical inclination of the user to interact with particular connected devices within particular contexts. For example, the user can have a historical inclination to turn on their stereo system when their kitchen lights are on, or shortly after their kitchen lights are requested to be on. When such a context subsequently arises, the automated assistant can provide a composite graphical assistant interface that includes a graphical control element for controlling the stereo system at a display panel within the kitchen, or at another suitable display panel. Furthermore, the automated assistant can identify other connected devices that are associated with the user, the request from the user, the context of the request, and/or any other information associated with the user, in order to provide further suggestions at the composite graphical assistant interface. For instance, the user can also have a historical inclination to pre-heat their kitchen oven at night, after turning on the kitchen lights and the music from the stereo system. When such a context subsequently arises, the automated assistant can provide a composite graphical assistant interface that includes a graphical control element for controlling one or more functions of their oven, such as a bake temperature and a timer setting, as well as graphical control elements controlling features of their kitchen lights and/or stereo system. In this way, the composite graphical assistant interface generated by the automated assistant can provide controls for a variety of different connected devices that are made by a variety of different manufacturers.

The above description is provided as an overview of some implementations of the present disclosure. Further description of those implementations, and other implementations, are described in more detail below.

In some implementations, a method implemented by one or more processors is set forth as including operations such as determining that a spoken utterance, detected via an automated assistant interface of a client device, includes a request for a connected device to perform a function. The connected device can be linked with the client device and the spoken utterance is provided by a user. The method can further include identifying, in response to receiving the request, a function that corresponds to the received request; causing, based on identifying the function, the connected device to perform the function corresponding to the received request; and determining, by an automated assistant in response to receiving the request, another function for recommending to the user based on data accessible to the automated assistant. The other function is different than the identified function and is capable of being performed by another connected device. The method can also include identifying, based on determining the other function, a graphical control element for controlling the other function of the other connected device; and causing, in response to receiving the request, a composite graphical assistant interface to be provided at a display panel associated with the client device, the composite graphical assistant interface comprising the identified graphical control element. The method can further include determining that a selection of the graphical control element was received at the composite graphical assistant interface; and causing, in response to determining the selection of the graphical control element was received, the other connected device to perform the other function.

In some implementations, the data identifies a current status of the other connected device and the current status is caused to change to a different status in response to the selection of the graphical control element. In some implementations, determining the other function includes: determining, based on the data, that a particular function at least partially effectuated the current status of the other connected device; and selecting the other function for recommending to the user, based at least in part on the particular function being different than the other function. In some implementations, determining the other function further includes: determining one or more slot values of functions that at least partially effectuated the current status of the other connected device; and identifying one or more parameters of one or more candidate functions capable of affected the current status of the connected device. Selecting the other function for recommending to the user, at least based on the particular function being different than the other function can include selecting the other function from the one or more candidate functions according to whether the other function includes a parameter that is different than another parameter of the particular function.

In some implementations, the method can include determining a speakable command that, when provided to the automated assistant interface, initializes the automated assistant to cause the other function of the other connected device to be performed. Causing the composite graphical assistant interface to be provided at the display panel can include causing the display panel to show the speakable command.

In some implementations, determining another function for recommending to the user based on data accessible to the automated assistant includes: identifying one or more additional connected devices, from a plurality of connected devices, based on a stored device topology that indicates a relationship of the one or more additional connected devices to the client device or the connected device; and identifying one or more additional functions capable of being performed by the one or more additional connected devices. The other function can be determined from the identified one or more additional functions. In some implementations, the data can indicate a status of the one or more additional connected devices that is based on a previous performance or nonperformance of the other function, and a configuration of the graphical control element at the composite graphical assistant interface can be based on the indicated status of the one or more additional connected devices.

In other implementations, a method implemented by one or more processors, is set forth as including operations such as determining that a spoken utterance, detected via an automated assistant interface of a client device, includes a request for an automated assistant to perform a function, wherein the spoken utterance is provided by a user. The method can also include identifying, in response to determining the request was provided by the user, a function that corresponds to the request provided by the user; and causing, based on identifying the function, the automated assistant to perform the function that corresponds to the request provided by the user. The method can further include determining a first current status of a first connected device that is linked to the client device. The first current status of the connected device can be affected by performance or nonperformance of a first function. The method can also include determining a second current status of a second connected device that is linked to the client device. The second current status of the second connected device can be affected by performance or nonperformance of a second function. The method can further include determining a first graphical control element for controlling the first function of the first connected device and a second graphical control element for controlling the second function of the second connected device; and determining, based on the first current status of the first connected device and the second current status of the second connected device, a configuration for each graphical control element of the first graphical control element and the second graphical control element. The method can also include causing, in response to determining the request was received, a display panel associated with the client device to provide a composite graphical assistant interface for controlling the first connected device and the second connected device. The composite graphical assistant interface can include the first graphical control element and the second graphical control element, which are arranged according to the determined configuration for each graphical control element.

In some implementations, the method can include accessing a profile for the user based on a voice characteristic associated with the spoken utterance. Identifying at least one function of the first function and the second function can be based on information available from the profile. In some implementations, the profile can indicate a historical inclination of the user to interact with at least one connected device of the first connected device and the second connected device within a particular context. In some implementations, the particular context can correspond one or more of a status of the first connected device or the second connected device, a geographic location of the user, a time, an event, or a presence of one or more other users.

In some implementations, determining a first current status of the first connected device can include requesting, from a remote server device, the first current status and command data associated with the first function. The first function can be different than the function performed by the automated assistant. In some implementations, the method can include receiving a selection of the first graphical control element for controlling the first function; and causing the client device to interact with the first connected device according to the command data associated with the first function.

In yet other implementations, a method implemented by one or more processors is set forth as including operations such as determining that a user has provided a request for an automated assistant to perform a function. The request can be received at an automated assistant interface of a client device. The method can further include causing, in response to determining that the user has provided the request, the automated assistant to perform the function that corresponds to the request; and determining, in response to determining that the user has provided the request, a user preference that characterizes a historical inclination of the user to control another function of a connected device in a context associated with the function performed by the automated assistant. The connected device can be linked to the client device. In some implementations, the method can include identifying, based on determining the user preference, the connected device and the other function of the connected device; and determining, based on identifying the connected device, a current status of the connected device. The current status can be affected by performance or nonperformance of the other function. The method can also include determining, based on identifying the other function and the current status of the connected device, a graphical control element for controlling the other function of the connected device. The graphical control element can indicate a capability of the graphical control element to cause the other function to be performed by the connected device resulting in the current status of the connected device changing to a different status. The method can further include causing, in response to determining the user has provided the request, a display panel associated with the client device to provide a composite graphical assistant interface that includes the graphical control element.

In some implementations, the method can include determining, based on the user preference, a selectable suggestion element associated with a separate connected device of the connected devices. Causing the display panel to provide the composite graphical assistant interface can include causing the composite graphical assistant interface to include the selectable suggestion element. In some implementations, the method can include determining that a selection of the selectable suggestion element was received at the composite graphical assistant interface; and causing one or more graphical control elements to be presented at the display panel for controlling one or more functions of the separate connected device. In some implementations, the method can include determining a speakable command that, when received at the automated assistant interface of the client device, invokes the automated assistant to cause the separate connected device to perform the one or more functions of the separate connected device. Causing the display panel associated with the client device to provide the composite graphical assistant interface can include causing the composite graphical assistant interface to include a natural language representation of the speakable command.

In some implementations, the method can include determining, in response to causing the connected device to perform the function, a resulting status of the connected device. The resulting status can be affected by performance of the function. Furthermore, causing the display panel associated with the client device to provide the composite graphical assistant interface can include causing the composite graphical assistant interface to include a graphical interface element that indicates the resulting status of the connected device.

In some implementations, the user preference characterizes the historical inclination of the user to control the other function based at least on a previous interaction between the user and the automated assistant. In some implementations, the previous interaction corresponds to a dialog session between the user and the automated assistant, prior to the user selecting the graphical control element. During the dialog session, the user would have provided a request to cause the connected device to perform the other function.

In other implementations, a method implemented by one or more processors is set forth as including operations such as determining that a user has provided a request for an automated assistant to perform a function. The request can be received at an automated assistant interface of a client device. The method can also include causing, in response to determining that the user has provided the request, the automated assistant to perform the function that corresponds to the request; and determining, in response to determining that the user has provided the request, a user preference that characterizes a historical inclination of the user to control another function of a connected device in a context associated with the function performed by the automated assistant. The connected device can be linked to the client device. In some implementations, the method can include identifying, based on determining the user preference, the connected device and a desired status for the connected device; and determining, based on identifying the connected device, a current status of the connected device. The current status can be affected by performance or nonperformance of the other function. The method can further include, when the desired status is different than the current status of the connected device: determining, based on identifying the other function and the current status of the connected device, a graphical control element for controlling the other function of the connected device. The graphical control element can indicate a capability of the graphical control element to cause the other function to be performed by the connected device resulting in the current status of the connected device changing to the desired status. Furthermore, the method can include causing, in response to determining the user has provided the request, a display panel associated with the client device to provide a composite graphical assistant interface that includes the graphical control element.

In some implementations, the method can include, when the desired status is equivalent to the current status of the connected device: causing, in response to determining the user has provided the request, a display panel associated with the client device to provide a particular composite graphical assistant interface that omits a particular graphical control element for controlling the other function of the connected device.

Other implementations may include a non-transitory computer readable storage medium storing instructions executable by one or more processors (e.g., central processing unit(s) (CPU(s)), graphics processing unit(s) (GPU(s)), and/or tensor processing unit(s) (TPU(s)) to perform a method such as one or more of the methods described above and/or elsewhere herein. Yet other implementations may include a system of one or more computers and/or one or more robots that include one or more processors operable to execute stored instructions to perform a method such as one or more of the methods described above and/or elsewhere herein.

It should be appreciated that all combinations of the foregoing concepts and additional concepts described in greater detail herein are contemplated as being part of the subject matter disclosed herein. For example, all combinations of claimed subject matter appearing at the end of this disclosure are contemplated as being part of the subject matter disclosed herein.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates a system for providing composite graphical assistant interfaces for interacting with connected devices.

FIG. 2A and FIG. 2B illustrate views of a composite graphical assistant interface being provided with multiple different graphical control elements for controlling multiple different connected devices.

FIG. 3A and FIG. 3B illustrate views of a user being provided suggestions for controlling one or more connected devices based on an interaction between the user and an automated assistant, and/or a current status of one or more connected devices.

FIG. 4A and FIG. 4B illustrate methods for providing a composite graphical assistant interface that includes suggestions for functions capable of being performed by multiple different connected devices.

FIG. 5 illustrates a method for supplanting a composite graphical assistant interface with graphical control elements for controlling a connected device.

FIG. 6 is a block diagram of an example computer system.

DETAILED DESCRIPTION

FIG. 1 illustrates a system 100 for assembling composite graphical assistant interfaces for interacting with connected devices. A composite graphical assistant interface can be generated in part by an automated assistant 118 that is provided at one or more computing devices, such as a client device 102 (e.g., an assistant device 128), an additional client device (e.g., a tablet device 150) and/or a server device 112. A user 120 can interact with a client automated assistant 104 via an assistant interface 110, which can be a microphone, a camera, a touch screen display, a user interface, and/or any other apparatus capable of providing an interface between a user and an application. For instance, a user 120 can initialize the client automated assistant 104 by providing a verbal, textual, and/or a graphical input to the assistant interface 110, and/or the tablet device 150, to cause the client automated assistant 104 to perform an operation (e.g., provide data, control a peripheral device, access an agent, etc.). The client device 102 can include a display device, which can be a display panel that includes a touch interface for receiving touch inputs and/or gestures for allowing a user to control applications of the client device 102 via the touch interface. Alternatively, the client device 102 can be an assistant device 128 that does not include a display panel, but nonetheless provides one or more interfaces through which the user 120 can interact with the client automated assistant 104 (e.g., through spoken utterances provided by the user 120 to a microphone at the assistant device 128).

The client device 102 can be in communication with the server device 112 over a network 122, such as the internet. The client device 102 can offload computational tasks to the server device 112 in order to conserve computational resources at the client device 102 and/or in order to leverage more robust resources at the server device 112. For instance, the server device 112 can host the automated assistant 118, and the client device 102 can transmit inputs received at one or more assistant interfaces to the server device 112. However, in some implementations, the automated assistant 118 can be entirely hosted at the client device 102. In various implementations, all or less than all aspects of the client automated assistant 104 can be implemented on the client device 102. In some of those implementations, aspects of the automated assistant 118 are implemented via the client automated assistant 104 of the client device 102 and interface with the server device 112, which implements other aspects of the client automated assistant 104. The server device 112 can optionally serve a plurality of users and their associated assistant applications via multiple threads. In some implementations where all or less than all aspects of the automated assistant 118 are implemented via a client automated assistant 104 of the client device 102, the client automated assistant 104 can be an application that is separate from an operating system of the client device 102 (e.g., installed “on top” of the operating system)—or can alternatively be implemented directly by the operating system of the client device 102 (e.g., considered an application of, but integral with, the operating system).

In some implementations, the server device 112 can include a voice to text engine 116 that can process audio data received at an assistant interface 110 to identify the text embodied in the audio data. The process for converting the audio data to text can include a speech recognition algorithm, which can employ neural networks, word2vec algorithms, and/or statistical models for identifying groups of audio data corresponding to words or phrases. The text converted from the audio data can parsed by a text parser engine 108 and made available to the automated assistant 104 as textual data that can be used to generate and/or identify command phrases from the user and/or a third party application.

The user 120 can interact with a first connected device 132 and/or a second connected device 146 using their client device 102 and/or client automated assistant 104. Each connected device can be one or more IoT devices, which can be connected to or in communication with the client device 102 and/or the tablet device 150 over a local area network and/or a wide area network. For instance, the client device 102, the tablet device 150, the first connected device 132, and/or the second connected device 146 can be connected to a Wi-Fi network in a home or other environment. In order for the user 120 to interact with the first connected device 132, the user 120 can initialize an application associated with the first connected device 132. However, if the user 120 frequently interacts with multiple different connected devices, the user 120 may be required to operate a variety of different applications in order to control each connected device. In order to more efficiently use computational resources, and prevent wasting network bandwidth, the client automated assistant 104 and/or the automated assistant 118 can provide a composite graphical assistant interface from which the user can interact with one or more different connected devices (e.g., the first connected device 132 and the second connected device 146).

In order for the composite graphical assistant interface to be generated and/or provided, the user 120 can provide a spoken utterance 130 to the assistant interface 110. The spoken utterance can embody a request that indicates the user 120 would like the first connected device 132 to perform a particular function. Audio data corresponding to the spoken utterance 130 can be transmitted from the client device 102 to the server device 112. The audio data can then be processed by the voice to text engine 116 and by the text parser engine 114. Text resulting from the processing of the audio data can be used by a function identifying engine 142 in order to identify one or more functions with which the user 120 is referring to in their spoken utterance 130. For instance, the function identifying engine 142 can compare the text from the spoken utterance 130 to function data 136 made available at a connected device server 134. The function data 136 can provide a comprehensive list of functions that one or more connected devices (e.g., the first connected device 132 and the second connected device 146) are capable of performing. The function identifying engine 142 can compare the text to the function data 136 in order to identify one or more functions capable of being performed by the first connected device 132 and that the user is referring to. Function data 136 can additionally or alternatively be stored locally relative to the server device 112.

In response to determining that the text corresponds to one or more functions, the automated assistant 118, or the client automated assistant 104, can cause the first connected device 132 to perform the one or more identified functions. For example, when the first connected device 132 is an IoT device, such as, but not limited to, an oven, a function can include pre-heating the oven to a particular temperature. A command for causing the function to be performed at the first connected device 132 can be provided from the server device 112 to the first connected device 132. Alternatively, the command for causing the first connected device 132 to perform the function can be provided from the server device 112 to the connected device server 134, which can communicate with the first connected device 132 to cause the first connected device 132 to perform the function. Alternatively, the client device 102 can communicate the command to the first connected device 132, or the server device 112 can communicate the command to the client device 102, which can communicate with the first connected device 132 to cause the first connected device 132 to perform the function.

In response to receiving the spoken utterance 130 to cause the first connected device 132 to perform the function, the automated assistant can cause a composite graphical assistant interface to be provided at a display device (e.g., a tablet device 150, television, laptop, cellphone) associated with the client device 102. In order to generate the composite graphical assistant interface, the automated assistant 118 can identify, from the function data 136, one or more other functions capable of being performed by the first connected device 132. In some implementations, in response to determining that the spoken utterance 130 was received at the client device 102, the automated assistant can determine one or more other functions capable of being performed by a second connected device 146 (e.g., a different IoT device), the tablet device 150, and/or any other device that is associated with the client device 102. For instance, the function data 136 can include one or more lists of functions that corresponds to devices associated with the user 120, the client device 102, and/or the first connected device 132. The automated assistant can determine the one or more functions in order to provide suggestions or recommendations for other functions that the user 120 might want to be performed, at least in view of their request to cause the first connected device 132 to perform a function.

For instance, in some implementations, a profile for a user can be identified based on voice characteristic of a spoken utterance received at the assistant interface 110. The profile can indicate preferences associated with each of a plurality of particular connected devices that are associated with a user, such as preferences learned based on historical interactions of the user with the connected device(s). Furthermore, the profile can identify a group of connected devices that the user 120 prefers to interact with simultaneously, sequentially, and/or within a particular context or contexts. For instance, the user 120 can request that the automated assistant cause the first connected device 132 to perform a first function and the second connected device 146 to perform a second function, and the profile can identify the first function and the second function as related based on, for example, having occurred one or more times in temporal proximity to one another. Thereafter, when the user 120 is interacting with the automated assistant to cause the first connected device 132 to perform the first function, the automated assistant can access the profile to identify the second function based on its stored relation with the first function, and provide the user 120 with a selectable suggestion for causing the second connected device 146 to perform the second function.

In some implementations, historical interactions between the user 120 and the automated assistant can be processed, with permission from the user 120, to determine a historical inclination of the user 120 to interact with particular connected devices in particular contexts. For instance, the user 120 can have a history of interacting with two IoT devices in the morning, therefore the automated assistant can be put on notice that, when the user 120 is interacting with at least one of the two IoT devices in the morning, the user 120 will be inclined to interact with the other IoT device, at least until the morning is over. Additionally, or alternatively, the automated assistant can provide suggestions based on a location of the user 120. For instance, a user 120 can have a history of causing their oven (e.g., the first connected device 132) to perform a first function and their tablet device 150 to perform a second function when the user 120 is in their kitchen 148. Therefore, when the user 120 causes the first connected device 132 to perform the first function, and the user 120 is determined to be in the kitchen 148, the automated assistant can cause a selectable suggestion element to be provided at a composite graphical assistant interface. The selectable suggestion element can operate as a suggestion for the user 120 to cause the second function to be performed at the second connected device 146. The user can be determined to be in the kitchen 148 based on, for example, the client device 102 or tablet device 150 being utilized by the user to perform the first function. “Kitchen” can optionally be identified as an explicit location based on, for example, device 102 and/or device 150 being assigned a kitchen label 148. Alternatively, a location of the user can more generally be a location corresponding to the client device 102 or tablet device 150. In some implementations, the context can be whether the user 120 is participating in a particular event, such as one that is identified in calendar data that is accessible to the automated assistant. In some implementations, the context can be whether one or more other users, different from the user 120, are present at a particular location.

When the automated assistant has identified other functions with which to suggest to the user 120, the identified other functions can be used as a basis from which to identify graphical control elements 140. For instance, a function such as turning on and off a fan can be controlled by a graphical control element 140 corresponding to a two-way switch. Furthermore, a function such as adjusting a temperature of an oven or a volume of music can be controlled using a graphical control element 140 corresponding to a dial, which can be turned to select a value of a range of values. The automated assistant 118 or client automated assistant 104 can map the identified other functions to the graphical control elements 140, which can be stored at or otherwise accessible to the server device 112.

A composite interface engine 144 can map the graphical control elements 140 to the identified other functions in order to generate the composite graphical assistant interface. In some implementations, the automated assistant 118 can access status data 138 corresponding to a status of the first connected device 132. The status data 138 can indicate device statuses, which can be affected by performance or non-performance of one or more functions of the first connected device 132 and/or the second connected device 146. For example, a status of the first connected device 132 can be “off” when no function of the first connected device 132 is being performed. Alternatively, a status can correspond to a table or string of data that indicates a condition(s) or parameter(s) of particular functions of the first connected device 132 and/or the second connected device 146. A status for each of the first connected device 132 and the second connected device 146 can be used as a basis from which to determine configurations for each graphical control element 140 that has been mapped to a function of the first connected device 132 and the second connected device 146. For example, if the first connected device 132 includes heating element that is in an off state, and a function of the first connected device 132 is to turn on the heating element, a graphical control element corresponding to a two-way switch can be provided at the composite graphical assistant interface and configured in a way that shows the heating element being off. In other words, each graphical control element can reflect each current state of the first connected device 132 and the second connected device 146 when the composite graphical assistant interface is presented at the tablet device 150.

In some implementations, an arrangement and/or a selection of the graphical control elements for the composite graphical assistant interface can be based on data associated with interactions between the user 120 and the automated assistant and/or the first connected device 132. For example, a particular group of graphical control elements corresponding to a particular group of functions can be selected for a composite graphical assistant interface based on a user having previously caused such functions to be performed within a threshold period of time. Alternatively, or additionally, the particular group of graphical control elements corresponding to a particular group of functions can be selected for a composite graphical assistant interface based on how a function affects a particular user, a particular location, particular device(s), and/or any other relationship that can be exhibited by functions of a device. For example, a composite graphical assistant interface generated for a user can include graphical control elements corresponding to functions of multiple different connected devices that affect a home in which the user 120 is currently located. Therefore, the selection of the graphical control elements can be based on the location of the user, an identifier for the connected device, a calendar schedule of the user, and/or any other data that can be used to estimate geolocation data for the user.

In some implementations, the user 120 can provide a spoken utterance 130 to an assistant device 128 that lacks an integrated display panel or otherwise cannot readily present a composite graphical assistant interface at the assistant device 128. In such implementations, the automated assistant can receive the spoken utterance 130 at the assistant device 128 and cause a function requested by the user 120 to be performed by the first connected device 132. However, in response to receiving the request to perform the function and determining that a display panel is not available at the assistant device 128, the automated assistant can identify one or more candidate devices. The candidate devices can be identified based on their ability to display an assistant interface, their proximity to the user 120, their proximity to the assistant device 128, their proximity to the first connected device 132, and/or a preference of the user 120 for a particular candidate device. For instance, when the automated assistant determines that a display panel is not available at the assistant device 128 and determines that a tablet device 150 or television is the next most proximate display device to the assistant device 128, the automated assistant can cause the tablet device 150 or the television to present the composite graphical assistant interface. In this way, because the user 120 may not always be near a display panel when they are requesting a particular function be performed by a connected device, the automated assistant can still cause the composite graphical assistant interface to be generated at a nearby display device. In some implementations, the automated assistant can cause the assistant device 128 to audibly notify the user 120 of the location of the composite graphical assistant interface in response to the user 120 providing the request for the connected device 132 to perform a function. For instance, in response to the user 120 providing the spoken utterance, “Assistant, please pre-heat my oven,” the automated assistant can cause the assistant device 128 to audibly output the response, “Ok, I've provided an interface with additional controls at your tablet device.” The automated assistant can therefore put the user 120 on notice that they can find additional controls for the first connected device 132 and the second connected device 146 if they go retrieve their tablet device.

FIGS. 2A and 2B illustrate views 200 and 210 of a composite graphical assistant interface 222 being provided with multiple different graphical control elements for controlling multiple different connected devices. The composite graphical assistant interface 222 can be provided to a user 202 in response to the user 202 providing a spoken utterance 208 to an automated assistant via an automated assistant interface 214. The automated assistant interface 214 can be, for example, at least a microphone capable of providing a signal in response to a user 202 providing an audible spoken utterance 208 to the automated assistant interface 214. For instance, the spoken utterance 208 can be, “Assistant, increase the temperature in my kitchen.” In response to receiving the spoken utterance 208, the automated assistant can determine the function and/or the connected device with which the user 202 is seeking to interact, and can cause the connected device to perform the function the user 202 is requesting be performed. For example, in response to receiving the spoken utterance 208, the automated assistant can cause a thermostat in the home of the user 202 to adjust a temperature of at least the kitchen of the home.

In order to mitigate unnecessarily processing subsequent spoken utterances provided from the user 202, and thereby preserve computational and network resources, the automated assistant can cause function suggestions to be provided at a display panel 206 of a client device 204, or a separate device. For instance, in response to receiving the spoken utterance 208, the automated assistant, the client device 204, and/or a remote server device, can access data from which to provide recommendations for controlling other functions of other connected devices. A connected device can be a device that is linked to, or otherwise, capable of being in communication with the client device 204 and/or the automated assistant. The data can provide information about a status of each connected device, and the suggestions provided at the composite graphical assistant interface 222 can be based on particular statuses of particular connected devices. For instance, a status of the kitchen lights at the time the user 202 provided the spoken utterance 208 can be “off,” therefore, because the user 202 has expressed interest in changing the environment of the kitchen, the user 202 may also be interested in changing a status of the lights in the kitchen. Specifically, the lights in the kitchen can have non-binary parameters associated with their brightness, therefore, in response to determining the user 202 may also be interested in adjusting the kitchen lights, the automated assistant can select a dial graphical control element, which can act as a kitchen brightness control 218.

In some implementations, the kitchen brightness control 218 can be selected for the composite graphical assistant interface 222 based on the kitchen lights being within a common location as the kitchen, which the user 202 explicitly expressed interest in changing the temperature of. Alternatively, or additionally, the kitchen brightness control 218 can be selected for recommending to the user 202 at the composite graphical assistant interface 222 based on a historical inclination of the user 202 to modify the kitchen temperature and the kitchen lights within a particular context. The historical inclination of the user 202 can be indicated as part of a user profile or preference data that characterizes the inclination based on previous interactions between the user 202 and the automated assistant, and/or the user 202 and one or more connected devices. In some implementations, such interactions can occur within a particular context, such as a particular time 212 (e.g., after 6:00 AM), therefore the automated assistant can generate the composite graphical assistant interface 222 based on the particular context. Alternatively, or additionally, the context can be characterized by one or more statuses of one or more respective connected devices.

Alternatively, or additionally, the automated assistant can identify a request from the user and compare the request to available data (e.g., preference data and/or historical interaction data) to determine one or more functions to suggest to the user. When the automated assistant has identified the one or more functions to suggest, the automated assistant can identify each connected device corresponding to each function of the one or more identified functions. A status of each connected device can then be determined in order to ensure that a suggestion for a function to perform is not redundant, or would result in a status of a connected device not changing. For example, a user can typically turn on their kitchen lights in the morning, and subsequently turn on their coffee maker. In such scenarios, the automated assistant can provide a spoken utterance to initialize the automated assistant to cause the kitchen lights to turn on (e.g., “Assistant, turn on my kitchen lights”). Based on data that indicates the historical interactions wherein the user turns on both the kitchen lights and the coffee maker, the automated assistant can identify the coffeemaker and/or a function for turning on the coffeemaker, in response to receiving the request to turn on the kitchen lights. In order to ensure that a function suggestion is not inconsequential, the automated assistant can compare a current status of the coffeemaker to a status that the user typically desires for the connected device when the user turns on the kitchen lights. If the current status of the coffeemaker reflects the desired status, the automated assistant can omit providing a graphical control element at a composite graphical assistant interface for controlling the coffeemaker. However, if the current status of the coffeemaker does not reflect the desired status (e.g., the coffeemaker is off, and is therefore not exhibiting the desired status), the automated assistant can select a graphical control element for controlling the function that turns the coffeemaker on. In this way, the composite graphical assistant interface can be optimized to provide graphical control elements that will effectuate changes that a user would typically desire, rather than providing suggestions to connected devices that may have already occurred.

For example, the composite graphical assistant interface 222 can include a variety of different graphical control elements that are selected for the composite graphical assistant interface 222 based on a status of the particular connected device to which they correspond. For instance, because the user 202 has solicited the automated assistant to turn the temperature of the kitchen up, the automated assistant can identify other connected devices in the kitchen that the user may have an interest in controlling, such as an oven and/or a coffee maker. Because each of these connected devices are off (as indicated by a configuration of an oven temperature control 216 and a coffee maker control 220), these connected devices can be candidate connected devices for control suggestions. Furthermore, the user 202 can have a history of increasing the temperature of the kitchen, adjusting the lights in the kitchen, pre-heating their oven, and turning on their coffee maker, all within a threshold range of time after 6:00 AM. Despite each of these devices and/or their respective control applications being provided by different third parties, the automated assistant can compile the composite graphical assistant interface 222 in order to allow the user 202 to control each device from a single interface.

In some implementations, as a result of the user 202 selecting one of the graphical control elements at the composite graphical assistant interface 222, a status of a corresponding connected device can be changed. For instance, initially the brightness of the kitchen lights is indicated as “off,” at least based on the kitchen brightness control 218 being at a lowest level. When the user 202 adjusts the kitchen brightness control 218 to a higher level, a status of the kitchen lights can be updated to have an “on” status. Furthermore, the status of the kitchen lights can be updated to reflect a particular slot value generated in response to the adjustment of the graphical control element. For instance, the slot value can be “60%,” indicating that the kitchen brightness control 218 was adjusted up to 60%, therefore the status of the kitchen lights can be updated to be “kitchen_lights: status(lights[on], brightness[60], hue[null]).” Thereafter, any other subsequent suggestions can be based the updated status of the kitchen lights. For instance, because the kitchen lights now have a status of being “on,” the automated assistant can cause the composite graphical assistant interface 222 to provide a graphical control element for modifying the hue of the kitchen lights, at least based on the kitchen lights being on, and the “hue” parameter of the kitchen lights being “null” or in a default setting.

In some implementations, the composite graphical assistant interface 222 can offer a status indication 224 of the function performed in response to the user 202 providing the spoken utterance 208. Furthermore, in response to receiving the spoken utterance 208, the automated assistant can cause the composite graphical assistant interface 222 to provide a suggestion that is embodied as natural language that characterizes a spoken utterance. The spoken utterance can be one that, when provided to the automated assistant interface, initializes the automated assistant to cause one or more connected devices to perform one or more functions. For instance, in response to receiving the spoken utterance 208, the automated assistant can identify a status of a connected device located within an area affected by the spoken utterance 208 (e.g., the kitchen). The automated assistant can then identify a natural language input that, when received at the automated assistant interface, would initialize the automated assistant to cause the connected device to perform a function. For example, when the connected device is a coffeemaker, the automated assistant can identify a previously employed spoken utterance for controlling the coffeemaker such as, “Assistant, turn on the coffeemaker.” In response to identifying the aforementioned spoken utterance, the automated assistant can cause the graphical assistant interface 222 to present the spoken utterance as a selectable suggestion element that includes at least the text “turn on the coffeemaker.” In this way, the user 202 can either select the selectable suggestion element to turn on the coffeemaker, or speak the text as a spoken utterance in order to initialize the automated assistant to cause the coffeemaker to turn on. The automated assistant can also retrieve command data that can be used by the client device 204 to interact with a particular connected device. In this way, as the automated assistant decides on the particular suggestions to make to the user 202, the automated assistant can also locally provide data that will allow a suggested function to be executed by a connected device. As a result, the automated assistant and/or the client device will not necessarily have to communicate with a remote server, in response to a user selection of a suggestion, to get additional information about how to cause the connected device to perform the function.

FIGS. 3A and 3B illustrate views 300 and 344 of a user 302 being provided suggestions for controlling one or more connected devices based on an interaction between the user 302 and an automated assistant, and/or a current status of one or more connected devices. For instance, the user 302 can walk into their kitchen 320 and initialize a client automated assistant 330 via an assistant interface 332 of a client device 328 (e.g., an assistant device 326). The user 302 can initialize the client automated assistant 330 by providing a spoken utterance 322 such as, “Assistant, play music in the kitchen.” The client automated assistant 330 can receive the spoken utterance 322 and cause the spoken utterance 322 to be processed at the client device 328, or a separate remote server, in order to identify an action or function the user 302 wishes to be performed. Because the client device 328 does not include a display panel, the client device 328 can access a device topology to determine a suitable device through which to provide a composite graphical assistant interface 306 from which to further control any functions associated with the requested function (i.e., playing music), and/or any other functions associated with other connected devices identified in the device topology.

For instance, a stored device topology accessible to the client automated assistant 330 can identify a tablet device 316, a first connected device 318, and a second connected device 324 within a device topology that identify devices in the kitchen 320. The device topology can further identify the tablet device 316 as being the only device with a display panel, therefore any composite graphical assistant interface to be presented to the user 302 while they are in the kitchen 320 can be provided at the tablet device 316. Furthermore, in order to generate a suitable composite graphical assistant interface, the automated assistant can identify a status of each connected device and/or determine a historical inclination of the user 302 to interact with one of the connected devices after the requested function (i.e., playing music) is performed. For instance, because a current status of the first connected device 318 (e.g., an oven) is off, the automated assistant can suggest that the user control the oven by presenting a selectable suggestion element 310 at the composite graphical assistant interface 306. Furthermore, because a current status of a second connected device 324 (e.g., a dishwasher) is off, the automated assistant can suggest that the user control the dishwasher by presenting a selectable suggestion element 312 at the composite graphical assistant interface 306. Moreover, the automated assistant can determine whether there are any other functions associated with the initially requested function of playing music and provide a graphical control element at the composite graphical assistant interface 306 for controlling the other function. For instance, the function of turning on music, which can be performed by the client automated assistant 330 at the client automated assistant 330, can cause a status of the client device to be at least, “client_device: status(music[on], volume[default]).” In response to a parameter of the client device 328 being in default mode, the automated assistant can provide a graphical control element for controlling the corresponding parameter. For instance, the automated assistant can provide a kitchen volume control 314, in order to control the volume of the music in the kitchen 320. However, if the user 302 did not provide the previous request corresponding to turning on music, the automated assistant can omit the kitchen volume control 314 from the composite graphical assistant interface 306.

FIG. 3B can illustrate a view 344 of a different composite graphical assistant interface 340 provided by the automated assistant in response to the user 302 selecting a selectable suggestion element at the composite graphical assistant interface 306 of FIG. 3A. For instance, in response to the user 302 selecting the “control oven” selectable suggestion element 310, the automated assistant can provide a different composite graphical assistant interface 340 that includes a dial 334 for setting an oven temperature. An initial configuration of the dial 334 can be at “0” in order to reflect a current status of the first connected device 132, and/or one or more current parameters of the first connected device 132. For instance, a current status of the first connected device 132 can be, “oven: status(power[off], temperature[0]),” therefore the configuration of the dial 334 can reflect multiple current parameters of the first connected device 318. In other words, the dial 334 being at its lowest state indicates that the oven is off and hence, the temperature is set to 0. Such parameters can be consolidated and reflected at a single graphical control element in order to preserve space in the composite graphical assistant interface 340, and provide a more intuitive modality through which to provide more information to the user 302. If the user 302 had selected the “control dishwasher” selectable suggestion element 312, a switch 336 can be presented at the different composite graphical assistant interface 340, based on the particular second connected device 324 having a current status of “off,” and a single parameter to control (on or off). In some implementations, metadata corresponding to the requested function, or a status of the requested function (e.g., play music) can be presented as a status indication 342. In this way, the user 302 does not necessarily have to invoke the automated assistant again to determine a current status associated with their request, nor does the user 302 have to swap between different interfaces in order to determine such information. Rather, the automated assistant metadata can be presented with graphical control elements associated with other third party devices, in order to provide a consolidated interface that will reduce the wasting of computational and network resources.

In some implementations, because a current status of another connected device in the kitchen indicate that parameters associated with the other connected device have been set, the automated assistant can omit providing a graphical control element for that connected device. For instance, if a current status of the kitchen lights is “on” when the user 302 provides the spoken utterance 322, the automated assistant can acknowledge the current status of the kitchen lights, and omit suggesting a function for turning off the lights. Such omissions can be based on one or more statuses of one or more devices in the kitchen or other areas of the home, and/or a historical inclination of the user 302 to not turn off their lights when they request music to be played in the kitchen. In some implementations, identifying the historical inclination can be based on a voice characteristic of the user 302, which allows the automated assistant to identify data associated with a specific user to be processed in order to make suitable suggestions for the user or users. In some implementations, the inclusion or omission of a particular graphical control element can be based on a particular context with which the user provided the spoken utterance 322, such as after or before a particular time 304, or within a threshold time period relative to a particular time, a geographic location, an event, and/or a presence of one or more users.

FIGS. 4A and 4B illustrate methods 400 and 410 for providing a composite graphical assistant interface that includes suggestions for functions capable of being performed by multiple different connected devices. The method 410 can be a continuation of method 400 as indicated by the continuation element “A,” encircled at FIG. 4A and FIG. 4B. The method 400 and 410 can be performed by one or more computing devices, applications, and/or any other apparatus or module capable of interacting with a connected device or automated assistant. The method 400 can include an operation 402 of determining that a spoken utterance, detected via an automated assistant interface of a client device, includes a request for a connected device to perform function. The automated assistant interface can be one or more of a microphone, touch display panel, camera, sensor, and/or any other apparatus through which a user can provide input to a device. Furthermore, the client device can be a computing device that can include a touch display panel that is integral to the computing device, or is otherwise in communication with the computing device. For instance, the client device can be an assistant device that is connected to a WiFi network, and the assistant device can be in communication with a separate display panel (e.g., a television or other display device) via the WiFi network. Any device with which the client device is in communications can be a connected device. For instance, any IoT device, in a home that includes a WiFi network with which a client device is connected, can be considered a connected device. Alternatively, or additionally, any device that the client device is in communications with over a local area network, a wide area network, and/or the internet can be a connected device. An automated assistant can communicate with a connected device by receiving commands from a user via an automated assistant interface and causing a client device to interact with the connected device. Alternatively, or additionally, in response to the automated assistant receiving a command, the automated assistant can cause a remote server to issue a command to the connected device. Alternatively, or additionally, the automated assistant can cause a remote server to issue a command to a connected device server that is associated with the connected device, in order that the connected device server will cause the connected device to perform a particular function.

The method 400 can further include an operation 404 of identifying, in response to receiving the request, a function that corresponds to the received request. The function can be identified by accessing a storage or list of available functions capable of being performed by one or more connected devices that in communication with the client device. In some implementations, a remote server device that at least partially hosts the automated assistant can query a separate connected device server, which can provide data that identifies one or more functions capable of being performed by the connected device. When the connected device is an IoT device, the function can be any executable operation capable of being performed by the connected device such as, but not limited to, providing information from a sensor, changing an input parameter for a hardware component of the connected device, causing the connected device to perform a network operation, causing the connected device to communicate another connected device, and/or any other function capable of being performed by a device. For example, the function, when executed by the connected device, can cause the connected device to turn off any lights controllable by the connected device (e.g., “function(connected_device: lights:{off})”, where “off” is a slot value of a parameter “lights”).

The method 400 can also include an operation 406 of causing, based on identifying the functions, the connected device to perform the function corresponding to the received request. The client device or automated assistant can cause the connected device to perform the function by transmitting command data to the connected device, or causing the command data to be transmitted to the connected device. As a result of the connected device performing the function, a status of the connected device can be modified in order to reflect any change effectuated by execution of the connected device. The status of the connected device can be characterized by data such as, “connected_device: status(lights[off], brightness[null], hue[null]),” where particular parameters or slot values can be empty or null as a result of particular slot values (e.g., “off”) provided to the connected device. In some implementations, depending on a context in which the function was executed by the connected device, a profile associated with a user, or the context, can be updated or otherwise modified to reflect the request for the function to be performed. In this way, data corresponding to historical interactions between the user and the connected device and/or the automated assistant can be subsequently processed in order to enhance future interactions between the user and the connected device and/or the automated assistant. For instance, when the function corresponds to a request to turn off the lights of a device at night, the profile can be updated to rank the particular function higher at night, and/or otherwise indicate a historical inclination of the user to request the function be executed at night. Thereafter, the profile can be used as a basis from which to provide recommendations for functions to be executed by various different connected devices, at least according to a historical inclination of the user to request those, or similar, functions within a particular context. In some implementations, the profile can indicate functions that should be suggested according to a status of one or more connected devices. In this way, the automated assistant can check one or more statuses of one or more connected devices, and determine a historical inclination of the user to request particular functions based on the one or more statuses of the one or more connected devices.

The method 400 can further include an operation 408 of determining, by an automated assistant in response to receiving the request, another function for recommending to the user based on data accessible to an automated assistant. The other function can be different than the function performed at operation 406 and can be performed by a different connected device than the connected device that performed the function at operation 406. The other function can be determined based on data that is accessible to the automated assistant and indicates a historical inclination of the user to cause the other function to be performed, or interact with the other connected device, in the current context. For instance, the other function can be determined based on a current status of the connected device that performed the function at operation 406. Therefore, when the current status of the connected device is “connected_device: status(lights[off], brightness[null], hue[null]),” the suggested other function can be to turn on a security alarm of a security system (e.g., the other connected device), at least based on a previous interaction when the user turned off their lights—causing the status of the lights to change—and turned on their security system. By providing such a suggestion, the computational resources and network resources can be preserved, compared to the user issuing voice commands to cause such functions to be performed, thereby resulting in speech processing.

The method 400 can proceed to method 410, as indicated by continuation element “A,” encircled in FIG. 4A and FIG. 4B. The method 410 can include an operation 412 of identifying, based on determining the other function, a graphical control element for controlling the other function of the other connected device. The graphical control element can be selected from multiple graphical control elements that are stored in a memory that is accessible to the client device and/or the automated assistant. In some implementations, each graphical control element can be stored in association with metadata that indicates inputs capable of being characterized by the graphical control element and/or outputs capable of being provided in response to a selection or gesture received by the graphical control element. For instance, a graphical control element representing a two-way switch can be stored in association with metadata that indicates the graphical control element can receive a touch input and/or a swipe gesture, and/or the graphical control element can provide a binary output such as 1 or 0, and/or on or off. The memory can store a variety of different graphical control element, each capable of providing two or more different outputs, and/or receiving a variety of different inputs.

The method 410 can also include an operation 414 of causing, in response to receiving the request, a composite graphical assistant interface to be provided at a display device associated with the client device. The composite graphical assistant interface can be a graphical user interface with one or more selectable elements capable of causing a particular function to be performed at a connected device when a selection and/or gesture is received at the composite graphical assistant interface. The composite graphical assistant interface can act as a modality through which a user can interact with the automated assistant and/or multiple different connected devices, despite the connected devices being associated with other different third party applications for controlling them. In this way, a user does not necessarily have to download and/or access multiple different applications to control their respective IoT devices but, rather, can simply cause a composite graphical assistant interface to be provided at a display panel. Furthermore, the composite graphical assistant interface can be adapted by the automated assistant according to one or more statuses of one or more respective connected devices, and/or a context in which the user is interacting with the automated assistant and/or a client device. In some implementations, a composite graphical assistant interface can be associated with data that includes slot values for particular graphical control elements presented at the composite graphical assistant interface. The slot values can be provided to corresponding functions, which can then be performed by connected devices that correspond to any respective graphical control elements.

The method 410 can further include an operation 416 of determining that a selection of the graphical control element was received at the composite graphical assistant interface. The selection of the graphical control element can be determined based on data that is transmitted by a display panel, which provides the composite graphical assistant interface, in response to a user touching or otherwise providing a gesture at the display panel. Any slot value corresponding to the determined selection of the graphical control element can then be provided to a device that is responsible for further causing the function to be performed at a respective connected device, such as the connected device itself or a connected device server.

The method 410 can also include an operation 418 of causing, in response to determining the selection of the graphical control element was received, the other connected device to perform the other function. The other function can be, for example, turning on a security system of a home of the user. The other function can be suggested to the user, at least based on a historical inclination of the user to manually turn off their lights and turn on their security system. The other function can also be suggested based on a current status of the lights (e.g., the lights being off). Alternatively, or additionally, the other function can be suggested based on the automated assistant determining that the current status of the lights has been requested to change to “off” and/or that a current context in which the request was made corresponds to a time of day, a geographic location for the user, a status of one or more other devices, and/or any other information that can be used to cause a computer function to be performed. In response to the other function being performed, a current status can be updated at a table or other data storage that includes identifiers for statuses of various connected devices. In this way, the automated assistant can generate additional from an updated combination of statuses and as the statuses change over time. For instance, now that the security alarm is on and the lights are off, the automated assistant can determine a graphical control element for controlling a function of a communications device, in order to allow the user to place the communications device in a silent mode by interacting with the graphical control element at a new composite graphical assistant interface.

FIG. 5 illustrates a method 500 for supplanting a composite graphical assistant interface with graphical control elements for controlling a connected device. The method 500 can be performed by one or more computing devices, applications, and/or any other apparatus or module capable of interacting with a connected device and/or an automated assistant. The method 500 can include an operation 502 of determining that a user has selected a graphical control element at a first composite graphical assistant interface provided at a display panel associated with a client device. The display panel can be connected to the client device as an integral portion of the client device, or as a device that is in communication with the client device over a network. The first composite graphical assistant interface can be presented at the display panel in order to provide the user with a modality through which the user can control a variety of different connected devices, despite the connected devices being manufactured from the same or different entities. Therefore, the first composite graphical assistant interface can include multiple different graphical control elements through which the user can control various different connected devices.

The method 500 can also include an operation 504 of causing, in response to determining that the user has selected the graphical control element, a connected device to perform a function that corresponds to the selected graphical control element. The connected device can be caused to perform the function via the client device and/or the automated assistant. For instance, in response to the first composite graphical assistant interface receiving the selection of the graphical control element, the client device and/or the automated assistant can transmit data to the connected device to cause the connected device to perform the function. Alternatively, or additionally, the client device can communicate data to a remote server, and the remote server can communicate with the connected device to cause the connected device to perform the function.

The method 500 can further include an operation 506 of determining a user preference that characterizes a historical inclination of the user to control another function of another connected device in a context associated with the connected device. The user preference can be data that indicates an inclination or preference of the user to control particular functions of one or more connected devices within a particular context. For instance, the context can correspond to a status or statuses of one or more respective devices, a time of day, gesture or gaze of a user, a scheduled event, a correspondence to a previous interaction between the user and an automated assistant, and/or any other data that can be used as a basis from which to suggest functions for a user to control. In some implementations, the user preference or other data can rank functions according to their popularity or frequency that a particular user controls the functions within a particular context. In order to provide the rankings for functions, a learning model can be employed to receive inputs related to the functions performed, and output data corresponding to rankings for functions that should be suggested to the user. The learning model can be trained based on data that characterizes an interaction between the automated assistant and the user, and/or the user and one or more connected devices.

The method 500 can also include an operation 508 of identifying, based on determining the user preference, the other function of the other connected device. The user preference data can identify the other function, multiple different connected devices, one or more parameters and/or slot values corresponding to the other function, slot values that have been previously provided for the other function in particular contexts, and/or any other data that can be useful of executing a particular function.

The method 500 can also include an operation 510 of determining, based on identifying the other function, another graphical control element for controlling the other function of the other connected device. The other graphical control element can be selected from a memory that is accessible to the client device and/or the automated assistant, and can be associated with metadata that provides at least some correspondence to the other identified function. For instance, the function can be associated with a parameters can receive two or more different slot values, as opposed to a binary parameter, which can only take two. As a result, a graphical control element that would selected for controlling the graphical control element can be one that is capable of providing two or more different valued outputs, such as a dial or a scroll bar.

The method 500 can further include an operation 512 of causing the display panel associated with the client device to supplant the first composite graphical assistant interface with a second composite graphical assistant interface, which includes the other graphical control element. The second composite graphical assistant interface can replace the first composite graphical assistant interface in order to provide further suggestions for the user to control particular connected devices within a particular context. In some implementations, the first composite graphical assistant interface can correspond to a single connected device provided by a third party, and the second composite graphical assistant interface can correspond to multiple different connected devices provided by multiple different third parties. In this way, a user does not necessarily need to install and/or open a variety of different applications to control different connected devices, but, rather, can interact with a single composite graphical assistant interface in order to control a variety of different connected devices.

FIG. 6 is a block diagram of an example computer system 610. Computer system 610 typically includes at least one processor 614 which communicates with a number of peripheral devices via bus subsystem 612. These peripheral devices may include a storage subsystem 624, including, for example, a memory 625 and a file storage subsystem 626, user interface output devices 620, user interface input devices 622, and a network interface subsystem 616. The input and output devices allow user interaction with computer system 610. Network interface subsystem 616 provides an interface to outside networks and is coupled to corresponding interface devices in other computer systems.

User interface input devices 622 may include a keyboard, pointing devices such as a mouse, trackball, touchpad, or graphics tablet, a scanner, a touchscreen incorporated into the display, audio input devices such as voice recognition systems, microphones, and/or other types of input devices. In general, use of the term “input device” is intended to include all possible types of devices and ways to input information into computer system 610 or onto a communication network.

User interface output devices 620 may include a display subsystem, a printer, a fax machine, or non-visual displays such as audio output devices. The display subsystem may include a cathode ray tube (CRT), a flat-panel device such as a liquid crystal display (LCD), a projection device, or some other mechanism for creating a visible image. The display subsystem may also provide non-visual display such as via audio output devices. In general, use of the term “output device” is intended to include all possible types of devices and ways to output information from computer system 610 to the user or to another machine or computer system.

Storage subsystem 624 stores programming and data constructs that provide the functionality of some or all of the modules described herein. For example, the storage subsystem 624 may include the logic to perform selected aspects of method 400, method 410, method 500, and/or to implement one or more of connected device server 134, server device 112, automated assistant 118, client device 102, first connected device 132, second connected device 146, function identifying engine 142, composite interface engine 144, client device 204, client device 328, assistant device 326, and/or any other operation or apparatus discussed herein.

These software modules are generally executed by processor 614 alone or in combination with other processors. Memory 625 used in the storage subsystem 624 can include a number of memories including a main random access memory (RAM) 630 for storage of instructions and data during program execution and a read only memory (ROM) 632 in which fixed instructions are stored. A file storage subsystem 626 can provide persistent storage for program and data files, and may include a hard disk drive, a floppy disk drive along with associated removable media, a CD-ROM drive, an optical drive, or removable media cartridges. The modules implementing the functionality of certain implementations may be stored by file storage subsystem 626 in the storage subsystem 624, or in other machines accessible by the processor(s) 614.

Bus subsystem 612 provides a mechanism for letting the various components and subsystems of computer system 610 communicate with each other as intended. Although bus subsystem 612 is shown schematically as a single bus, alternative implementations of the bus subsystem may use multiple busses.

Computer system 610 can be of varying types including a workstation, server, computing cluster, blade server, server farm, or any other data processing system or computing device. Due to the ever-changing nature of computers and networks, the description of computer system 610 depicted in FIG. 6 is intended only as a specific example for purposes of illustrating some implementations. Many other configurations of computer system 610 are possible having more or fewer components than the computer system depicted in FIG. 6.

In situations in which the systems described herein collect personal information about users (or as often referred to herein, “participants”), or may make use of personal information, the users may be provided with an opportunity to control whether programs or features collect user information (e.g., information about a user's social network, social actions or activities, profession, a user's preferences, or a user's current geographic location), or to control whether and/or how to receive content from the content server that may be more relevant to the user. Also, certain data may be treated in one or more ways before it is stored or used, so that personal identifiable information is removed. For example, a user's identity may be treated so that no personal identifiable information can be determined for the user, or a user's geographic location may be generalized where geographic location information is obtained (such as to a city, ZIP code, or state level), so that a particular geographic location of a user cannot be determined. Thus, the user may have control over how information is collected about the user and/or used.

While several implementations have been described and illustrated herein, a variety of other means and/or structures for performing the function and/or obtaining the results and/or one or more of the advantages described herein may be utilized, and each of such variations and/or modifications is deemed to be within the scope of the implementations described herein. More generally, all parameters, dimensions, materials, and configurations described herein are meant to be exemplary and that the actual parameters, dimensions, materials, and/or configurations will depend upon the specific application or applications for which the teachings is/are used. Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific implementations described herein. It is, therefore, to be understood that the foregoing implementations are presented by way of example only and that, within the scope of the appended claims and equivalents thereto, implementations may be practiced otherwise than as specifically described and claimed. Implementations of the present disclosure are directed to each individual feature, system, article, material, kit, and/or method described herein. In addition, any combination of two or more such features, systems, articles, materials, kits, and/or methods, if such features, systems, articles, materials, kits, and/or methods are not mutually inconsistent, is included within the scope of the present disclosure. 

We claim:
 1. A method implemented by one or more processors, the method comprising: determining that a spoken utterance, detected via an automated assistant interface of a client device, includes a request for an automated assistant to perform a function, wherein the spoken utterance is provided by a user; identifying, in response to determining the request was provided by the user, the function that corresponds to the request provided by the user; causing, based on identifying the function, the automated assistant to perform the function that corresponds to the request provided by the user; determining a first current status of a first connected device that is linked to the client device, wherein the first current status of the connected device is affected by performance or nonperformance of a first function; determining a second current status of a second connected device that is linked to the client device, wherein the second current status of the second connected device is affected by performance or nonperformance of a second function; determining a first graphical control element for controlling the first function of the first connected device and a second graphical control element for controlling the second function of the second connected device; determining, based on the first current status of the first connected device and the second current status of the second connected device, a configuration for each graphical control element of the first graphical control element and the second graphical control element; and causing, in response to determining the request was received, a display panel associated with the client device to provide a composite graphical assistant interface for controlling the first connected device and the second connected device, wherein the composite graphical assistant interface includes the first graphical control element and the second graphical control element, which are arranged according to the determined configuration for each graphical control element.
 2. The method of claim 1, further comprising: accessing a profile for the user based on a voice characteristic associated with the spoken utterance, wherein identifying at least one of the first function and the second function is based on information available from the profile.
 3. The method of claim 2, wherein the information available from the profile indicates a historical inclination of the user to interact with at least one of the first connected device and the second connected device within a particular context.
 4. The method of claim 3, wherein the particular context corresponds to one or more of a status of the first connected device or the second connected device, a geographic location of the user, a time, an event, or a presence of one or more other users.
 5. The method of claim 4, wherein the particular context corresponds to the status of the first connected device and the status of the second connected device.
 6. The method of claim 1, wherein determining the first current status of the first connected device includes requesting, from a remote server device, the first current status.
 7. The method of claim 6, further comprising: receiving a selection of the first graphical control element for controlling the first function; and causing the client device to interact with the first connected device according to command data associated with the first function, wherein the first function is different than the function performed by the automated assistant.
 8. The method of claim 7, further comprising requesting the command data from the remote server device.
 9. The method of claim 1, wherein determining the first graphical control element for controlling the first function is based on determining that the first function is not redundant in view of the first current status of the first connected device.
 10. A system comprising one or more processors and memory operably coupled with the one or more processors, wherein the memory stores instructions that, in response to execution of the instructions by one or more processors, cause the one or more processors to perform the following operations: determining that a spoken utterance, detected via an automated assistant interface of a client device, includes a request for an automated assistant to perform a function, wherein the spoken utterance is provided by a user; identifying, in response to determining the request was provided by the user, the function that corresponds to the request provided by the user; causing, based on identifying the function, the automated assistant to perform the function that corresponds to the request provided by the user; determining a first current status of a first connected device that is linked to the client device, wherein the first current status of the connected device is affected by performance or nonperformance of a first function; determining a second current status of a second connected device that is linked to the client device, wherein the second current status of the second connected device is affected by performance or nonperformance of a second function; determining a first graphical control element for controlling the first function of the first connected device and a second graphical control element for controlling the second function of the second connected device; determining, based on the first current status of the first connected device and the second current status of the second connected device, a configuration for each graphical control element of the first graphical control element and the second graphical control element; and causing, in response to determining the request was received, a display panel associated with the client device to provide a composite graphical assistant interface for controlling the first connected device and the second connected device, wherein the composite graphical assistant interface includes the first graphical control element and the second graphical control element, which are arranged according to the determined configuration for each graphical control element.
 11. The system of claim 10, wherein the operations further include: accessing a profile for the user based on a voice characteristic associated with the spoken utterance, wherein identifying at least one of the first function and the second function is based on information available from the profile.
 12. The system of claim 11, wherein the information available from the profile indicates a historical inclination of the user to interact with at least one of the first connected device and the second connected device within a particular context.
 13. The system of claim 12, wherein the particular context corresponds to one or more of a status of the first connected device or the second connected device, a geographic location of the user, a time, an event, or a presence of one or more other users.
 14. The system of claim 13, wherein the particular context corresponds to the status of the first connected device and the status of the second connected device.
 15. The system of claim 10, wherein determining the first current status of the first connected device includes requesting, from a remote server device, the first current status.
 16. A non-transitory computer readable storage medium configured to store instructions that, when executed by one or more processors, cause the one or more processors to perform operations that include: determining that a spoken utterance, detected via an automated assistant interface of a client device, includes a request for an automated assistant to perform a function, wherein the spoken utterance is provided by a user; identifying, in response to determining the request was provided by the user, the function that corresponds to the request provided by the user; causing, based on identifying the function, the automated assistant to perform the function that corresponds to the request provided by the user; determining a first current status of a first connected device that is linked to the client device, wherein the first current status of the connected device is affected by performance or nonperformance of a first function; determining a second current status of a second connected device that is linked to the client device, wherein the second current status of the second connected device is affected by performance or nonperformance of a second function; determining a first graphical control element for controlling the first function of the first connected device and a second graphical control element for controlling the second function of the second connected device; determining, based on the first current status of the first connected device and the second current status of the second connected device, a configuration for each graphical control element of the first graphical control element and the second graphical control element; and causing, in response to determining the request was received, a display panel associated with the client device to provide a composite graphical assistant interface for controlling the first connected device and the second connected device, wherein the composite graphical assistant interface includes the first graphical control element and the second graphical control element, which are arranged according to the determined configuration for each graphical control element.
 17. The non-transitory computer readable storage medium of claim 16, wherein the operations further include: accessing a profile for the user based on a voice characteristic associated with the spoken utterance, wherein identifying at least one of the first function and the second function is based on information available from the profile.
 18. The non-transitory computer readable storage medium of claim 17, wherein the information available from the profile indicates a historical inclination of the user to interact with at least one of the first connected device and the second connected device within a particular context.
 19. The non-transitory computer readable storage medium of claim 18, wherein the particular context corresponds to one or more of a status of the first connected device or the second connected device, a geographic location of the user, a time, an event, or a presence of one or more other users.
 20. The non-transitory computer readable storage medium of claim 19, wherein the particular context corresponds to the status of the first connected device and the status of the second connected device. 